PACKING AND PARTITIONING ORBITOPES 



VOLKER KAIBEL AND MARC E. PFETSCH 

Abstract. We introduce orbitopes as the convex hulls of 0/1-matrices 
that are lexicographically maximal subject to a group acting on the 
columns. Special cases are packing and partitioning orbitopes, which 
arise from restrictions to matrices with at most or exactly one 1-entry 
in each row, respectively. The goal of investigating these polytopes is to 
gain insight into ways of breaking certain symmetries in integer programs 
by adding constraints, e.g., for a well-known formulation of the graph 
coloring problem. 

We provide a thorough polyhedral investigation of packing and parti- 
tioning orbitopes for the cases in which the group acting on the columns 
is the cyclic group or the symmetric group. Our main results are com- 
plete linear inequality descriptions of these polytopes by facet-defining 
inequalities. For the cyclic group case, the descriptions turn out to be 
totally unimodular, while for the symmetric group case, both the de- 
scription and the proof are more involved. The associated separation 
problems can be solved in linear time. 



1. Introduction 

Symmetries are ubiquitous in discrete mathematics and geometry. They 
are often responsible for the tractabihty of algorithmic problems and for the 
beauty of both the investigated structures and the developed methods. It 
is common knowledge, however, that the presence of symmetries in integer 
programs may severely harm the ability to solve them. The reasons for this 
are twofold. First, the use of branch-and-bound methods usually leads to an 
unnecessarily large search tree, because equivalent solutions are found again 
and again. Second, the quality of LP relaxations of such programs typically 
is extremely poor. 

A classical approach to "break" such symmetries is to add constraints 
that cut off equivalent copies of solutions, in hope to resolve these problems. 
There are numerous examples of this in the literature; we will give a few 
references for the special case of graph coloring below. Another approach 
was developed by Margot [ITJ[12]. He studies a branch-and-cut method that 
ensures to investigate only one representative of each class of equivalent solu- 
tions by employing methods from computational group theory. Furthermore, 
the symmetries are also used to devise cutting planes. Methods for symme- 
try breaking in the context of constraint programming have been developed, 
for instance, by Fahle, Schamberger, and Sellmann [7] and Puget |16| . 
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The main goal of this paper is to start an investigation of the polytopes 
that are associated with certain symmetry breaking inequahties. In order to 
clarify the background, we first discuss the example of a well-known integer 
programming (IP) formulation for the graph coloring problem. 

Let G = {y, E) be a loopless undirected graph without isolated nodes. 
A (vertex) coloring of G using at most C colors is an assignment of colors 
{!,... , C} to the nodes such that no two adjacent nodes receive the same 
color. The graph coloring problem is to find a vertex coloring with as few 
colors as possible. This is one of the classical NP-hard problems [9]. It is 
widely believed to be among the hardest problems in combinatorial opti- 
mization. In the following classical IP formulation, 1/ = {1, . . . ,n} are the 
nodes of G and C is some upper bound on the number of colors needed. 

c 



mm 



Xij + Xkj<yj {i,k} £ E, j e {I,... ,C} (i) 



(1) 



c 

"^Xij = l ieV (ii) 

j=i 

Xi,G{0,l} i£V, j G{1,...,C} (iii) 
y,G{0,l} iG {!,..., C} (iv) 

In this model, variable Xij is 1 if and only if color j is assigned to node i and 
variable yj is 1 if color j is used. Constraints (i) ensure that color j is assigned 
to at most one of the two adjacent nodes i and k; it also enforces that yj 
is 1 if color j is used, because there are no isolated nodes. Constraints (ii) 
guarantee that each node receives exactly one color. 

It is well known that this formulation exhibits symmetry: Given a solution 
{x,y), any permutation of the colors, i.e., the columns of x (viewed as an 
n X C-matrix) and the components of y, results in a valid solution with the 
same objective function value. Viewed abstractly, the symmetric group of 
order C acts on the solutions {x, y) (by permuting the columns of x and the 
components of y) in such a way that the objective function is constant along 
every orbit of the group action. Each orbit corresponds to a symmetry class 
of feasible colorings of the graph. Note that "symmetry" here always refers 
to the symmetry of permuting colors, not to symmetries of the graph. 

The weakness of the LP-bound mentioned above is due to the fact that 
the point {x*,y*) with x*^ = 1/C and y* = 2/C is feasible for the LP 
relaxation with objective function value 2. The symmetry is responsible 
for the feasibility of (x*,y*), since x* is the barycenter of the orbit of an 
arbitrary x G {0, l}"^*^ satisfying (ii) in ([T]). 

It turned out that the symmetries make the above IP-formulation for the 
graph coloring problem difficult to solve. One solution is to develop differ- 
ent formulations for the graph coloring problem. This line has been pursued, 
e.g., by Mehrotra and Trick [13], who devised a column generation approach. 
See Figueiredo, Barbosa, Maculan, and de Souza [8] and Cornaz [5] for al- 
ternative models. 

Another solution is to enhance the IP-model by additional inequalities that 
cut off as large parts of the orbits as possible, keeping at least one element of 
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each orbit in the feasible region. Mendez-Dlaz and Zabala [15] showed that a 
branch-and-cut algorithm using this kind of symmetry breaking inequalities 
performs well in practice. The polytope corresponding to ([T]) was investi- 
gated by Campelo, Correa, and Prota [3] and Coll, Marenco, Mendez-Diaz, 
and Zabala j^. Ramani, Aloul, Markov, and Sakallah [T^ studied symme- 
try breaking in connection with SAT-solving techniques to solve the graph 
coloring problem. 

The strongest symmetry breaking constraints that Mendez-Diaz and Za- 
bala [131 US] introduced are the inequalities 

i-l 

Xij — Xkj-i < 0, for all i and j > 2. (2) 

k=l 

From each orbit, they cut off all points except for one representative that 
is the maximal point in the orbit with respect to a lexicographic ordering. 
A solution {x, y) of the above IP-model is such a representative if and only 
if the columns of x are in decreasing lexicographic order. We introduce a 
generalization and strengthening of Inequalities ^ in Section 14.11 

Breaking symmetries by adding inequalities like ((2|) does not depend on the 
special structure of the graph coloring problem. These inequalities single out 
the lexicographic maximal representative from each orbit (with respect to the 
symmetric group acting on the columns) of the whole set of all 0/1-matrices 
with exactly one 1-entry per row. The goal of this paper is to investigate 
the structure of general "symmetry breaking polytopes" like the convex hull 
of these representatives. We call these polytopes orbitopes. The idea is that 
general knowledge on orbitopes (i.e., valid inequalities) can be utilized for 
different symmetric IPs in order to address both the difficulties arising from 
the many equivalent solutions and from the poor LP-bounds. In particular 
with respect to the second goal, for concrete applications it will be desirable 
to combine the general knowledge on orbitopes with concrete polyhedral 
knowledge on the problem under investigation in oder to derive strengthened 
inequalities. For the example of graph coloring, we indicate that (and how) 
this can be done in Section [3 Figure [1] illustrates the geometric situation. 

The case of a symmetric group acting on the columns is quite important. 
It does not only appear in IP-formulations for the graph coloring problem, 
but also in many other contexts like, e.g., block partitioning of matrices [1], 
/c-partitioning in the context of frequency assignment [6], or line-planning 
in public transport p]. However, other groups are interesting as well. For 
instance, in the context of timetabling in public transport systems [H], cyclic 
groups play an important role. 

We thus propose to study different types of orbitopes, depending on the 
group acting on the columns of the variable-matrix and on further restrictions 
like the number of 1-entries per row being exactly one (partitioning), at most 
one (packing), at least one (covering), or arbitrary (full). 

The main results of this paper are complete and irredundant linear de- 
scriptions of packing and partitioning orbitopes for both the symmetric group 
and for the cyclic group acting on the columns of the variable-matrix. We 
also provide (linear time) separation algorithms for the corresponding sets of 
inequalities. While this work lays the theoretical foundations on orbitopes. 
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Figure 1: Breaking symmetries by orbitopes. The left figure illustrates an orbitope, i.e., 
the convex hull of the representatives of a large system of orbits. For a concrete problem, 
like graph coloring, only a subset of the orbits are feasible (the dark orbits). Combining 
a (symmetric) IP-formulation for the concrete problem with the orbitope removes the 
symmetry from the formulation (right figure). 

a thorough computational investigation of the practical usefulness of the re- 
sults will be the subject of further studies (see also the remarks in SectionE]). 

The outline of the paper is as follows. In Section [2], we introduce some 
basic notations and define orbitopes. In Section ETT] we show that optimiza- 
tion over packing and partitioning orbitopes for symmetric and cyclic groups 
can be done in polynomial time. In Section Owe give complete (totally uni- 
modular) linear descriptions of packing and partitioning orbitopes for cyclic 
groups. Section m deals with packing and partitioning orbitopes for symmet- 
ric groups, which turn out to be more complicated than their counterparts for 
cyclic groups. Here, besides (strengthenings of) Inequalities ([2]), one needs 
exponentially many additional inequalities, the "shifted column inequalities", 
which are introduced in Section 14.21 We show that the corresponding sepa- 
ration problem can be solved in linear time, see Section [43l Section l44l gives 
a complete linear description, and Section [451 investigates the facets of the 
polytopes. We summarize the results for symmetric groups in Section 14.61 
for easier reference. Finally, we close with some remarks in Section [5l 

2. Orbitopes: General Definitions and Basic Facts 

We first introduce some basic notation. For a positive integer n, we define 
[n] := {1,2,..., n}. We denote by the 0-matrix or 0- vector of appropriate 
sizes. Throughout the paper let p and q be positive integers. For x E IR.H^I'J] 
and 5* C [p] X [g], we write 



For convenience, we use S — {i,j) for S\{{i,j)} and S+{i,j) for 5u{(i,j)}, 
where 5* C [p] x [q] and (i, j) € [p] x [q]. lip and q are clear from the context, 
then rowj := {{i, 1), (i, 2), . . . , (i, q)} are the entries of the ith row. 

Let Mp^g := {0, be the set of 0/1-matrices of size px q. We define 

o M^q := {x G Mp^q : x(rowj) < 1 for all i} 
o Mp q := {x G Mp^q : x(rowj) = 1 for all i} 
o M^q := {x G Mp^q : x(rowj) > 1 for all i}. 
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Let -< be the lexicographic ordering of Mp,q with respect to the ordering 

(1,1) < (1,2) < ••• < (1,(?) < (2,1) < (2,2) < ••• < (2,g) < ••• < {p,q) 

of matrix positions, i.e., A ^ B with A = (uij), B = {pij) € Mp,q if and only 
if aki < bki, where {k,i) is the first position (with respect to the ordering 
above) where A and B differ. 

Let &n be the group of all permutations of [n] (symmetric group) and let G 
be a subgroup of &q, acting on M.p.g by permuting columns. Let A4™g^(G) 
be the set of matrices of A^p,g that are -(-maximal within their orbits under 
the group action G. 

We can now define the basic objects of this paper. 

Definition 1 (Orbitopes). 

(1) The full orbitope associated with the group G is 

Op,,(G) := conv A^^,^"(G). 

(2) We associate with the group G the following restricted orbitopes: 

Of g{G) := conv{M'l]^''{G) n 7W|g) (packing orbitope) 
Op giG) := conv(A^™g''(G) n Mp g) (partitioning orbitope) 
Of^qiG) ■■= conv{M'^q^{G) n Mf q) (covering orbitope) 

Remark. By definition, O^g(G) is a face of both O^g(G) and O^g(G). 

In this paper, we will be only concerned with the cases of G being the 
cyclic group (tq containing all q cyclic permutations of [q] (Section [3l) or the 
symmetric group 6q (Section Hj). Furthermore, we will restrict attention 
to packing and partitioning orbitopes. For these, we have the following 
convenient characterizations of vertices: 

Observation 1. 

(1) A matrix of Mp^q is contained in A^™g^(Sq) if and only if its columns 
are in non-increasing lexicographic order (with respect to the order -< 
defined above). 

(2) A matrix of A4^g is contained in Al™g^(Cg) if and only if its first column 
is lexicographically not smaller than the remaining ones (with respect to 
the order -< ). 

(3) In particular, a matrix of M.p ,j is contained in M^^^{'tq) if and only if 
it has a 1-entry at position (1, 1). 

2.1. Optimizing over Orbitopes 

The main aim of this paper is to provide complete descriptions of 0^g(©g), 
0^g{&q), Op^q{^q), and ©^^(e!^) by systems of linear equations and linear 
inequalities. If these orbitopes admit "useful" linear descriptions then the 
corresponding linear optimization problems should be solvable efficiently, due 
to the equivalence of optimization and separation, see Grotschel, Lovasz, and 
Schrijver [TO] . 

We start with the cyclic group operation, since the optimization problem 
is particularly easy in this case. 
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Theorem 1. Both the linear optimization problem over A4™g^(^g) n M-^^q 
and over M.^g^{(tq) H Mp^q can be solved in time 0{pq). 

Proof. We first give tiie proof for tlie pacliing case. 

For a vector c G Q^^l^t'^l, we consider the linear objective function 

p q 

(c, x) := ^ ^ ^ ^ Cij Xij. 
i=i j=i 

The goal is to find a matrix A* G Al™^''(£<j) n M^q such that {c,A*) is 
maximal. Let A* be such a c-maximal matrix, and let a* € {0, 1}*' be its 
first column. If a* = 0, then ^4* = by Part ([2|) of Observation [H By 
the same observation it follows that if a* 7^ and € [p] is the minimum 
row-index i with a* = 1, then A* has only zero entries in its first i* rows, 
except for the 1-entry at position {i* , 1) (there is at most one 1-entry in each 
row). Furthermore, each row i > i* of A* either has no 1-entry or it has its 
(unique) 1-entry at some position where c is maximal in row i. 

Thus, we can compute an optimal solution as follows: (1) For each i G [p] 
determine a vector 6* G {0, 1}'' that is the zero vector if c does not have 
any positive entries in row i and otherwise is the j-th standard unit vector, 
where j G [q] is chosen such that Cij = max{cj£ : i G [q]}; set cjj := in 
the first case and cjj := Cij in the second. (2) Compute the values Sp := dp 
and Si := + Sj+i for all i = p — l,p — 2, . . . , 1. (3) Determine i* such that 
Ci*,i is maximal among {^,1 + Sj+i : i G [p]}. (4) If Ci*,i < 0, 

then is an optimal solution. Otherwise, the matrix whose i-th row equals 6* 
for i £ {i* + 1, . . . ,p} and which is all-zero in the first i* rows, except for a 
1-entry at position («*,1), is optimal. 

From the description of the algorithm it is easy to see that its running 
time is bounded by 0{pq) (in the unit-cost model). 

The partitioning case is then straightforward and even becomes easier due 
to Part dS]) of Observation [H □ 

Theorem 2. Both the linear optimization problem over ^A^q^{&q) fl M.^^q 
and over M^^^{&q) H A^^g can be solved in time 0{p^q). 

Proof. We give the proof for the partitioning case, indicating the necessary 
modifications for the packing case at the relevant points. 

As in the proof of Theorem [TJ we maximize the linear objective function 
given by (c, x) for c G Qt^l^I'^l. We describe a two-step approach. 

In the first step, for ^1,^2 G [p\ with ii < 12 and j G [<?], we let M {11,12., j) 
be c-maximal among the matrices in {0, l}^*i'*i+^'---'*2}^b1 with exactly (in 
the packing case: at most) one 1-entry in every row. Denote by /i(ii,i2ii) 
the c- value of M{ii,i2,j), i.e., 

«2 j 

I^{h,i2,j) = X] X]'^'=^^*^*i'^2,i)fcf • 
k=ii i=i 

The values fi{ii,i2,j) can be computed in time 0{p^q) as follows. First, 
we compute all numbers X{i,j) = max{cj£ : i G [j]} (in the packing case: 
= max(0, {cj£ : i G [j]})) for all i G [p] and j G [q]. This can clearly 
be done in 0{pq) steps by using the recursions X{i,j) = max{A(i, j — 1), Cij} 
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Figure 2: Illustration of the proof of Theorem [2] Left: Computation of ^{ii,i2,j). 
Right: Computation of r(j,j) via the dynamic programming relation ((S}. Indicated are the 
matrix M(i, fc — 1, j — 1) and corresponding term ^(i,fc — — 1) and matrix r(fc + l,j + l) 
with corresponding term r(fc + l,j + 1). 



for j > 2. Then, after initializing fi{i,i,j) = X{i,j) for all i € [p] and 
j G [g], one computes /i(ii,i2,j) = fi{ii,i2 - 1, j) + A(i2,j) for all j G [q], 
ii = 1,2, . . . ,p, and ^2 = H + 1, ii + 2, . . . , g; see Figured 

In the second step, for « G [p] and j G [q], let T{i,j) be c-maximal among 
the matrices in {0, with exactly (in the packing case: at most) 
one 1-entry in every row and with columns j,j + l,...,q being in non- 
increasing lexicographic order. Thus, by Part ([1]) of Observation [U T(l,l) 
is an optimal solution to our linear optimization problem. Denote by r(i,j) 
the c-value of T(i,j), i.e., 

p <? 

Tihj) =^^CkeT{i,j)M. 

k=i 1=1 

Let k £ + . . . ,p+l} be the index of the first row, where T{i,j) has a 
1-entry in column j (with A: = p + 1 if there is no such 1-entry) ; see Figure [21 
Then T{i,j) has a c-maximal matrix T in rows A; -|- 1, . . . ,p with exactly (in 
the packing case: at most) one 1-entry per row and lexicographically sorted 
columns j + 1, . . . ,q (contributing r(A; -|- 1, j -|- 1)). In row k, there is a single 
1-entry at position {k,j) (contributing c^j). And in rows i, . . . ,k — 1, we 
have a c-maximal matrix M with exactly (in the packing case: at most) one 
1-entry per row in the first j — 1 columns (contributing /i(i, k—l,j — 1)) and 
zeroes in the remaining columns. Therefore, we obtain 

T{iJ) = fJ-{i,k- l,j - 1) +Ckj +T{k + + 1). 

Hence, considering all possibilities for k, we have 

T{i,j) =m.&-K{ ^i{i,k -I) + Ckj + T{k+l,j + 1) : (3) 

k£ {i,i + l,...,p + l}}, 

for all i G [p] and j G [q]. For convenience we define fi{ki,k2,0) = for 
ki,k2 G [p] with ki < ^2 and fj,{k,k — 1,£) = for all k G [p] and i G 
{0,1, ... ,q}. Furthermore, we set Cp+i^^ = for all i G [q]. Finally, we 
define r(p-F 2,^) =T{p+l,e) =T{k,q+i) = for all G [p] and £ G [q+1]. 

Thus, by dynamic programming, we can compute the table T{i,j) via 
Equation ([3|) in the order i = p,p — 1, . . . , 1, j = q,q — 1, . . . ,1. For each 
pair (i,j) the evaluation of ([3]) requires no more than 0(p) steps, yielding a 
total running time bound of 0{p^q). 



8 



KAIBEL AND PFETSCH 



Furthermore, if during these computations for each we store a maxi- 
mizer k{i,j) for A; in jS]), then we can easily reconstruct the optimal solution 
T(l, 1) from the fc-table without increasing the running time asymptotically: 
For i G [p], j G [q] the matrix T{i,j) is composed of M{i, k{i, j) — l,j — 1) 
(if k{i,j) > i + 1 and j > 2), T{k{i,j) + l,j + 1) (if k{i,j) < p - 1 and 
j < q — 1), and having 0-entries everywhere else, except for a 1-entry at 
position {k{i,j),j) (if k{i,j) < p). Each single matrix M{ii,i2,j) can be 
computed in 0((i2 — h)j) steps. Furthermore, for the matrices M{ii,i2,j) 
needed during the recursive reconstruction of T(l, 1), the sets {ii, . . . , «2}x [i] 
are pairwise disjoint (see Figure [2]). Thus, these matrices all together can be 
computed in time 0{pq). At the end there might be a single T{k, q + 1) to 
be constructed, which trivially can be done in 0{pq) steps. □ 

Thus, with respect to complexity theory there are no "obstructions" to 
finding complete linear descriptions of packing and partitioning orbitopes 
for both the cyclic and the symmetric group action. In fact, for cyclic group 
actions we will provide such a description in Theorem [3] and Theorem |4] for 
the partitioning and packing case, respectively. For symmetric group actions 
we will provide such a description for partitioning orbitopes in Theorems [16] 
and for packing orbitopes in Theorem [TTl The algorithm used in the proof 
of Theorem [T] (for cyclic groups) is trivial, while the one described in the 
proof of Theorem [2] (for symmetric groups) is a bit more complicated. This 
is due to the simpler characterization of the cyclic case in Observation [T] and 
is reflected by the fact that the proofs of Theorems [16] and [17] (for symmetric 
groups) need much more work than the ones of Theorems [3] and H] (for cyclic 
groups) . 

The algorithms described in the above two proofs heavily rely on the fact 
that we are considering only matrices with at most one 1-entry per row. 
For cyclic group operations, the case of matrices with more ones per row 
becomes more involved, because we do not have a simple characterization 
(like the one given in parts [2] and [3] of Observation [Tj) of the matrices in 
A^™g'^(e!q) anymore. For the action of the symmetric group, though we 
still have the characterization provided by Part ([T|) of Observation [T] the 
dynamic programming approach used in the proof of Theorem [2] cannot 
be adapted straight-forwardly without resulting in an exponentially large 
dynamic programming table (unless q is fixed). These difficulties apparently 
are refiected in the structures of the corresponding orbitopes (see the remarks 
in Section [5]). 

3. Packing and Partitioning Orbitopes for Cyclic Groups 

From the characterization of the vertices in parts ([2]) and ([3|) of Observa- 
tion [1] one can easily derive IP-formulations of both the partitioning orbitope 
O^g(C^q) and the packing orbitope 0^g(£g) for the cyclic group <tq. In fact, 
it turns out that these formulations do already provide linear descriptions of 
the two polytopes, i.e., they are totally unimodular. We refer the reader to 
Schrijver |18l Chap. 19] for more information on total unimodularity. 

It is easy to see that for the descriptions given in Theorems [3] and [4] below, 
the separation problem can be solved in time 0{pq). 
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Figure 3: Example of the coefficient vector for an inequality of type (dJ; "— " stands for 
a -1, "+" for a +1. 



Theorem 3. The partitioning orbitope Opq(Cq) for the cyclic group <tq 
equals the set of all x e IrH^IsI that satisfy the following linear constraints: 

o the equations xu = 1 and xij = for all 2 < j < q, 

o the nonnegativity constraints Xij > for all 2 < i < p and j G [q], 

o the row-sum equations x(rowj) = 1 for all 2 < i < p. 

This system of constraints is non-redundant. 

Proof. The constraints x{r:oWi) = 1 for i £ [p] and Xij > for i G € [q] 
define an integral polyhedron, since they describe a transshipment problem 
(and thus, the coefficient matrix is totally unimodular). Hence, the con- 
straint system given in the statement of the theorem describes an integer 
polyhedron, because it defines a face of the corresponding transshipment 
polytope. 

By Part ^ of Observation [H the set of integer points satisfying this con- 
straint system is A^^g n A^™g^(Cg). Hence the given constraints completely 
describe Op g{^g). The non-redundancy follows from the fact that dropping 
any of the constraints enlarges the set of feasible integer solutions. □ 

Theorem 4. The packing orbitope 0^g(£g) for the cyclic group <tq equals 
the set of all x G flW^W that satisfy the following linear constraints: 

o the constraints < xu < 1 and xij = for all 2 < j < q, 

o the nonnegativity constraints Xij > for all 2 < i < p and j G [q], 

o the row-sum inequalities x(rowj) < 1 for all 2 < i < p, 

o the inequalities 

q i—1 

-Yxki<G (4) 

j=2 k=l 

for all 2 < i < p (see Figure\3\for an example). 
This system of constraints is non-redundant. 

Proof. Prom Part ([2]) of Observation [T] it follows that an integer point is 
contained in 0^g{^q) if and only if it satisfies the constraints described in 
the statement, where Inequalities (|4| ensure that the first column of x is 
lexicographically not smaller than the other ones (note that we have at most 
one 1-entry in each row of x). Dropping any of the constraints enlarges 
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P2 P.-i P. Pi+i Pp 

Figure 4: The network matrix constructed in the proof of Theorem [D 

the set of integer solutions, which proves the statement on non-redundancy. 
Thus, as in the proof of the previous theorem, it remains to show that the 
polyhedron defined by the constraints is integral. We prove this by showing 
that the coefficient matrix A of the row-sum inequalities a;(rowi) < 1 (for 
2 < i < p) and Inequalities dl} (for all 2 < i < p) is a network matrix (and 
thus, totally unimodular). Adding the nonnegativity constraints amounts 
to adding an identity matrix and preserves total unimodularity, which also 
holds for the inclusion of xu < 1 into the system. 

In order to establish the claim on the network structure of A, we will 
identify a directed tree T, whose arcs are in bijection with [p] x [q] (the set 
of indices of the columns of A), such that there are pairs of nodes {vr,Wr) 
of T in bijection with the row indices r G [2(p — 1)] of ^4 with the following 
property. The matrix A has a (-l-l)-entry in row r and column if the 

unique path vr^ from node Vr to node Wr in the tree T uses arc (i,j) in its 
direction from z to j, a (— l)-entry, if vr^ uses in its reverse direction, 

and a 0-entry, if vr^ does not use 

For the construction of the tree T, we take a directed path Pi of length p 
on nodes {wii, ^21, ■ • • , I'p+i.i} with arcs an := (vj+i^i, Uji) for i G [p]; see 
Figure m For each 2 < i < p, we append a directed path Pi of length q — 1 to 
node Vii, where Pi has node set {vii,Vi2, ■ ■ ■ , Vig} and arcs Oij := {vij-i,Vij) 
for 2 < j < q. Choosing the pair (^;^+l^l, Ujg) for the i-th row sum-inequality 
and the pair (fii,Wig) for the i-th Inequality finishes the proof (using 
the bijection between the arcs of T and the columns of A indicated by the 
notation onj). □ 

4. Packing and Partitioning Orbitopes for Symmetric Groups 

For packing orbitopes 0^g{&q) and partitioning orbitopes O^g(Sg) with 
respect to the symmetric group it follows readily from the characterization 
in Part ^ of Observation [T] that the equations 

Xij = for all i < j (5) 
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are valid. Thus, we may drop all variables corresponding to components in 
the upper right triangle from the formulation and consider 

0|,(eg), 0=,(©,) C R^"-' with := {(ij) G \p] X [g] : i > j}. 

We also adjust the definition of 

rowj := {{i, 1), {i, 2), . . . , {i, mm{i, q})} for i € [p] 

and define the jth column for j € [q] as 

colj := {(i,i), (j + 1, j), • • • , 
Furthermore, we restrict ourselves to the case 

p>q>2 

in this context. Because of ([5]), the case of q > p can be reduced to the case 
p = q and the case of g = 1 is of no interest. 

The next result shows a very close relationship between packing and par- 
titioning orbitopes for the case of symmetric group actions. 

Proposition 5. The polytopes 0~q(Sq) and 0^_]^ g_]^(©g_i) are affinely 
isomorphic via orthogonal projection ofOp g{&g) onto the space 

C:={x e n^^'" : Xii = for all i e [p]} 



{and the canonical identification of this space with W-p 



-^r> —^.n — 



1,<7-1' 



Proof. The afhne subspace 

A:= {x e R-^P'" : 3;(rowi) = 1 for all i} 

of B!^p-i clearly contains 0^g(6q). Let tt : A B!^p-^'1-^ be the orthogonal 
projection mentioned in the statement (identifying C in the canonical way 
with IR-^p-^-''-^); note that the first row is removed since it only contains the 
element (1, 1). Consider the linear map (p '■ K,"^''"^'''"^ — > IR'^*''« defined by 

A( \ /l-y(rowi„i) ifj = l . 

(p{y)ij = s ^1 ■ fo'^ i^'-?) ^ 

[yi~i,j~i otherwise 

(where rowo = and y(0) = 0). This is the inverse of vr, showing that tt 
is an affine isomorphism. As we have 7r(0^g((3g)) = 0^_^ g_]^(Sg_i), this 
finishes the proof. □ 

It will be convenient to address the elements in Ip^g via a different "system 
of coordinates": 

ivJ) ■= {j + V- 1, j) for j G [q], l<r]<p-j + l. 

Thus (as before) i and j denote the row and the columns, respectively, while rj 
is the index of the diagonal (counted from above) containing the respective 
element; see Figure [5| | (a) [ for an example. For {k,j) = {r),j) and x G IR-^P'i , 
we write x^^j;) := a;(fc,j) := x^j. 

For X G {0, Ij^P'i we denote by := G Ip^q : Xij = 1} the set of 

all coordinates (positions in the matrix), where x has a 1-entry. Conversely, 
for I C 2p^q, we use € {0, ly-^P'i for the 0/1-point with xjj = 1 if and only 

if {i,j)el. 
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For G ^p,q, we define the column 

col(i,i) = (j + 1, j), •••,(«- ^ Jp^g, 

and for = (r/,j) we write col{r],j) := col(i,j). Of course, we have 

col(r?,j) ={(l,j),(2,j),...,(7/,j)}. 

The rest of this section is organized as follows. First, in Section |l?Tl we deal 
with basic facts about integer points in packing and partitioning orbitopes 
for the symmetric group. To derive a linear description of O^g(Sg) and 
Op ,j{&q) that only contains integer vertices, we need additional inequalities, 
the shifted column inequalities, which are introduced in Section [121 We then 
show that the corresponding separation problem can be solved in linear time 
(Section |4.3|1 . Section proves the completeness of the linear description 
and Section H3] investigates the facets of the polytopes. 

4.1. Characterization of Integer Points 
We first derive a crucial property of the vertices of 0^q(©g). 

Lemma 6. Let x be a vertex o/0^q(Sg) with {r],j) G {j > 2). Then we 
have n co\{rj,i — 1) / 0. 

Proof. With = we have Xij = 1, which implies a^ij-i = (since x 
has at most one 1-entry in row i). Thus, Pi col(r/, j — 1) = would yield 
Xk,j-i = for all k < i, contradicting the lexicographic order of the columns 
of X (see Part ^ of Observation [T|) . □ 

Definition 2 (Column inequality). For G Ip^q and the set B = 

{i, j + 1), . . . , {i, min{i, q})}, we call 

x{B) - x{co\{i - l,j - 1)) < 
a column inequality; see Figure [3 | (b )\ for an example with {i,j) = (9,5). 

The column inequalities are strengthenings of the symmetry breaking in- 
equalities 

Xij - x{col{i - 1, j - 1)) < 0, (6) 

introduced by Mendez-Diaz and Zabala [H] in the context of vertex-coloring 
(see (l2|) in the introduction). 

Proposition 7. A point x G {0, l}-^P'i is contained in O^q(Sg) (0^g(©q)) if 
and only if x satisfies the row-sum constraints x(row(z)) < 1 (x(row(z)) = 1) 
for all i \p] and all column inequalities. 

Proof. By Lemma EJ Inequalities are valid for O^g(Sq) (and thus, for 
its face Opg{&q) as well). Because of the row-sum constraints, all column 
inequalities are valid as well. Therefore, it suffices to show that a point 
X G {0, l}^P'i that satisfies the row-sum constraints x(row(z)) < 1 and all 
column inequalities is contained in ^A^^^^{&q). 

Suppose, this was not the case. Then, by Part ([T]) of Observation [H there 
must be some j G [q] such that the {j — l)-st column of x is lexicographically 
smaller than the j'th column. Let i be minimal with Xij = 1 (note that 
column j cannot be all-zero). Thus, x^j-i = for all k < i. This implies 
x(col(i - l,i - 1)) = < 1 — Xij, showing that the column inequality 
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(a) 



(b) 



(c) 



(d) 



Figure 5: (a) Example for coordinates (9, 5) = (5, 5). (b) (d) Shifted column inequalities 
with leader (5,5), see Definition^ All SCI inequalities are <-inequalities with right-hand 
sides zero and "— " stands for a (—1) -coefficient, "+" for a (+1) coefficient. The shifted 
column of Ois {(1, 2), (2, 3), (3, 3), (4, 4), (5, 4)}. 



x{B) — x{col{i — l,j — 1)) < is violated by the point x for the bar B = 
{hi + !),•••, («,mm{i,g})}. □ 

4.2. Shifted Column Inequalities 

Proposition [7] provides a characterization of the vertices of the packing- 
and partitioning orbitopes for symmetric groups among the integer points. 
Different from the situation for cyclic groups (see Theorems [3] and [4]) , how- 
ever, the inequalities in this characterization do not yield complete descrip- 
tions of these orbitopes. In fact, we need to generalize the concept of a 
column inequality in order to arrive at complete descriptions. This will yield 
exponentially many additional facets (see Proposition [T4l) . 

Definition 3 (Shifted columns). A set S = {(1, ci), (2, C2), ...,(??, c,,)} C 
Zp^q with rj > 1 and ci < C2 < • • • < is called a shifted column. It is a 
shifting of each of the columns 

co\{ri,Cr,),col{ri,Cr, + 1), . . .,col{r],q). 

Remark. 

o As a special case we have column col(i, j), which is the shifted column 
{(l,j),(2,j),--- AvJ)} for {r],j) = 

o By definition, if 5 = {(1, ci), (2, 02), ■ ■ ■ , {r], c,,)} C Ip^q is a shifted col- 
umn, then so is {(1, ci), (2, C2), . . . , {r]' , c^')} for every I < 1]' < t]. 

Lemma 8. Let x be a vertex of 0^g{6q) with {r],j) € {j > 2). Then we 
have n S ^ for all shiftings S of col{r],j — 1). 

Proof. We proceed by induction on j. The case j = 2 follows from Lemma El 
because the only shifting of col(?/, 1) is col(r/, 1) itself. Therefore, let j > 3, 
and let 5 = {(1, ci), (2, C2), . . . , {rj, c^)} be a shifting of col{r],j — 1) (hence, 
ci < C2 < ■ ■ ■ < < j — 1). Since by assumption {ri,j) G Lemma [6] yields 
that there is some r]' < rj with (r/',^ — 1) S ■ If — 1) € 5, then we are 
done. Otherwise, c^' < j — 1 holds. Hence, {(1, ci), (2, C2), . . . , (V) c^')} is a 
shifting of (col(7/',c^/) and hence of) col{r]',j — 2), which, by the inductive 
hypothesis, must intersect I^. □ 
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Figure 6: The two cases arising in the dynamic programming algorithm of Section 



Definition 4 (Shifted column inequalities). For = {r],j) € Ip,q, B = 
{(i, j), (i, j + 1), . . . , (i, min{i, q})}, and a shifting S of col(r/, j — 1), we call 

x{B) - x{S) < 

a shifted column inequality (SCI). The set B is the bar of the SCI, and 

is the leader of (the bar of) the SCI. The set S is the shifted column (SC) 

of the SCI. See FigurelMfor examples. 

In particular, all column inequalities are shifted column inequalities. The 
class of shifted column inequalities, however, is substantially richer: It con- 
tains exponentially many inequalities (in q). 

Proposition 9. Shifted column inequalities are valid both for the packing 
orbitopes 0|^g((5q) and for the partitioning orbitopes O^q(Sg). 

Proof. As 0^g(©g) is a face of 0^g{&q), it is enough to prove the proposition 
for packing orbitopes 0^,j(6g). Therefore, let (i,j) = {rj,j) G 2p^g, with 
j > 2, and let S = {(1, ci), (2, C2), . . . , {r], Crj)} be a shifting of col(r/, j — 1). 
Denote by B the bar of the corresponding SCI. 

Let X £ {0, l}^P'i be a vertex of Of g{&q). If S n = 0, then clearly 
x{B) — x{S) = — x{S) < holds. Otherwise, there is a unique element 
ihj') = iv'ij') £ B (1 I^ . As j' > j, we have rj' < rj. Therefore <S" = 
{{l,ci),{2,C2),...,{r]',Cri')} C 5 is a shifting of col(?7',j' - 1). Thus, by 
Lemma [HI we have S' H I'-^ ^ 0. This shows x{S) > x{S') > 1, implying 
x{B) - x{S) < 1 - 1 = 0. □ 

4.3. A Linear Time Separation Algorithm for SCIs 

In order to devise an efficient separation algorithm for SCIs, we need 
a method to compute minimal shifted columns with respect to a given 
weight vector w G (Q-^P'?. The crucial observation is the following. Let 
S = {(1, ci), (2, C2), . . . , {1], Crj)} with 1 < ci < C2 < ■ ■ ■ < < j be a shift- 
ing of col{r],j) for {ri,j) € 2p,q with rj > I. If < j, then 5 is a shifting of 
col(r/,j — 1) {Case 1). If = j, then 

S - ivJ) = {(l,ci), (2,C2), . . . , (r? - l,c^-i)} 

is a shifting of col(r7 — [Case 2); see Figure [6l 

For all {r],j) € 2p^g, let uj{r],j) be the weight of a tii-minimal shifting of 
col{r],j). The table {uj{rj,j)) can be computed by dynamic programming as 
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follows; we also compute a table of values T{ri,j) € {1,2}, for each 
which are needed later to reconstruct the corresponding shifted columns: 

(1) For j = 1,2, . . . initialize io{l,j) := min{u;(^^) : £ G [j]}. 

(2) For 1] = 2,3, . . . ,p, initialize Lo{r], 1) := Lo{r] — 1,1)+ . 

(3) For 7] = 2,3, . . . ,p, j = 2,3, . . . ,q (with (r/, j) £ Ip^g) : Compute 

uJi :=uj{r],j -I) and uj2 ■= uj{r] - 1, j) + W(^^j'^ 
corresponding to Cases 1 and 2, respectively. Then set 



Thus, the tables {uj{r],j)) and {T{r],j)) can be computed in time 0{pq). 
Furthermore, for a given {r],j) £ Ip,g, we can compute a w-minimdl shifting 
S{r],j) of col(r/,j) in time 0{r]) from the table {T{r],j)): We have S{l,j) = 
{(1, j)} for all j G [q], S{r], 1) = col(?7, 1) for all rj G [p], and 



for all other {r],j). This proves the following result. 

Theorem 10. Let w G Q-^P'' be a given weight vector. There is an 0{pq) 
time algorithm that simultaneously computes the weights of w-minimal shift- 
ings of col{ri,j) for all {r],j) G 2p^q and a data structure that afterwards, for 
a given (rj,j), allows to determine a corresponding shifted column in 0{if) 
steps. 

In particular, we obtain the following: 

Corollary 11. The separation problem for shifted column inequalities can 
be solved in linear time 0{pq). 

Proof. Let a point x* G Q-^P'' be given. We can compute the x*-values 
(3{i,j) := x*{B{i,j)) of all bars B{i,j) = {i,j + 1), . . . , (i, min{i, g})} 

in linear time in the following way: First, we initialize j3{i,tj = x*^ for all 
i G \p] and i = mm{i,q}. Then, for each i G [p], we calculate the value 
Pihj) = x*j + P{i,j + 1) for j = mm{i,q} - l,min{i,g} - 2, . . . , 1. 

Using Theorem [10] (and the notations introduced in the paragraphs pre- 
ceeding it), we compute the table {io{r],j)) and the mentioned data struc- 
ture in time 0{pq). Then in time 0{pq) we check whether there exists an 
{i,]) = (??,j) G Tp^q with j >2 and uj{r],j — 1) < (3{i,j). If there exists such 
an (?7,j), we compute the corresponding shifted column S{T],j — 1) (in addi- 
tional time 0{r]) C 0{p)), yielding an SCI that is violated by x* . Otherwise 
X* satisfies all SCIs. □ 

Of course, the procedure described in the proof of the corollary can be 
modified to find a maximally violated SCI if x* does not satisfy all SCIs. 



'^(f/,j) 





S{r],j - 1) if r(r?,j) = 1 

S{r]-l,j)U{{v,j)} ifT(7?,j)=2 



4.4. Complete Inequality Descriptions 



In this section we prove that nonnegativity constraints, row-sum equa- 
tions, and SCIs suffice to describe partitioning and packing orbitopes for 
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symmetric groups. The proof will be somewhat more involved than in the 
case of cyclic groups. In particular, the coefficient matrices are not totally 
unimodular anymore. In order to see this, consider the three column in- 
equalities 

3^3,3 - X2,2 < 0, X4,3 + 2:4,4 - ^2,2 - ^3,2 < 0, and 

3^5,4 + a;5,5 - a;3,3 - 2:4,3 < 0. 

The submatrix of the coefficient matrix belonging to these three rows and 
the columns corresponding to (2,2), (3,3), and (4,3) is the matrix 

-1 +1 
-1 +1 
-1 -1 

whose determinant equals —2. Note that the above three inequalities define 
facets both of 0^,j(Sg) and ^(©g) forp>q>5 (see Propositions [T4l 
and [T5l respectively). 

Proposition 12. The partitioning orbitope Op g{&q) is completely described 
by the nonnegativity constraints, the row-sum equations, and the shifted col- 
umn inequalities: 

Opg{&q) = {x e U^"'" : x>0, a:(rowj) = 1 for i = I, . . . ,p, 

x{B) — x{S) < for all SCIs with SC S and bar B }. 

Proof. Let P be the polyhedron on the right-hand side of the statement 
above. Prom Propositions [7] and [9] we know already that 



p n = O-qi&q) n 'L^'^'" 



holds. Thus, it suffices to show that P is an integral polytope (as O^g(Sg) 
is by definition). In the following, we first describe the strategy of the proof. 

For the rest of the proof, fix an arbitrary vertex x* of P. A basis B of x* is a 
cardinality \1p^q\ subset of the constraints describing P that are satisfied with 
equality by x* with the property that the \1p,q\ x |Xp,g|-coefRcient matrix of 
the left-hand sides of the constraints in B is non-singular. Thus, the equation 
system obtained from the constraints in B has x* as its unique solution. 

We will show that there exists a basis B* of x* that does not contain 
any SCI. Thus, B* contains a subset of the p row-sum equations and at 
least —p nonnegativity constraints. This shows that x* has at most p 
nonzero entries and, since x* satisfies the row-sum equations, it has a nonzero 
entry in every row. Therefore, B* contains all p row-sum equations, and all p 
nonzero entries must in fact be 1. Hence, x* is a 0/1-point. So the existence 
of such a basis proves the proposition. 

The weight of a shifted column S = {(1, ci), (2, C2, ,)..., (r/, c,,)} with 
1 < ci < C2 < ■ ■ ■ < Crj < q (we will not need shifted columns with Cr^ = q 
here, as they do not appear in SCIs) is 

ri 

weight(S') ■=^ciq\ 

i=l 

In particular, if Si and S2 are two shifted columns with l^il < 1521, then 
we have weight(5i) < weight(S'2). The weight of an SCI is the weight of its 
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Figure 7: Illustration of trivial SCIs and of the three types of configurations not present 
in reduced bases of minimal weight, see Claim [S] Bars are shown in dark gray, shifted 
columns in light gray. Figure [(a)] shows trivial SCIs ("?" refers to a or 1). Figures (b) 
(c) and (d) refer to parts and ([3]) of Claim [3l respectively ("★" indicates any 

nonzero number). 



shifted column, and the weight of a basis B is the sum of the weights of the 
SCIs contained in B (note that a shifted column can appear in several SCIs). 

A basis of x* that contains all row-sum equations and all nonnegativ- 
ity constraints corresponding to 0-entries of x* is called reduced. As the 
coefficient vectors (of the left-hand sides) of these constraints are linearly 
independent, some reduced basis of x* exists. Hence, there is also a reduced 
basis B* of x* of minimal weight. 

To prove the proposition, it thus suffices to establish the following claim. 

Claim 1. A reduced basis of x* of minimal weight does not contain any SCI. 

The proof of Claim [T] consists of three parts: 

(1) We show that a reduced basis of x* does not contain any "trivial SCIs" 
(Claim [2|). 

(2) We prove that a reduced basis of x* of minimal weight satisfies three 
structural conditions on its (potential) SCIs (Claim [3]). 

(3) Finally, assuming that a reduced basis of x* with minimal weight con- 
tains at least one SCI, we will derive a contradiction by constructing a 
different solution x x* of the corresponding equation system. 

We are now ready to start with Part 1. We call an SCI with shifted 
column S trivial if x*{S) = holds or if we have x*{S) = 1 and x^^ = 
for all {k,£) £ S — for some (i,j) G S (thus satisfying x*j = 1) (see 

Figure [71[(4]). 

Claim 2. A reduced basis B of x* does not contain any trivial SCIs. 

Proof. Let 5 be the shifted column S and B be the bar of some SCI that is 
satisfied with equality by x*. 

If x*{S) = 0, then the coefficient vector of the SCI is a linear combination 
of the coefficient vectors of the inequalities Xij > for G S U B, which 
all are contained in B (due to x*{B) = x*{S) = 0). Since the coefficient 
vectors of the inequalities in B form a non-singular matrix, the SCI can not 
be in B. (By "coefficient vector" we always mean the vector formed by the 
coefficients of the left-hand side of a constraint.) 
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Figure 8: Illustration of the proof of Claim [31 parts ([T]) to ([3]). 



If S contains exactly one entry {k,£) G S with x^^ = 1, then we have 
x*{S) = x*{B) = 1. Let i be the index of the row that contains the bar B. 
The nonnegativity constraints Xrs > for (r, s) G S — {k,i), Xks > for 
{k, s) G rowfc — {k, i), and Xis > for {i, s) G rowj \B are contained in B. 

Since the coefficient vector of the considered SCI can hnearly be combined 
from the coefficient vectors of these nonnegativity constraints and of the row- 
sum equations x(rowfc) = 1 and x(rowj) = 1, this SCI cannot be contained 
in B. □ 

Claim 3. A minimal weight reduced basis B of x* satisfies the following three 
conditions: 

(1) // {k, i) is contained in the shifted column of some SCI in B, then there 
exists some s < i with x^,^ > 0. 

(2) // is the leader of an SCI in B, then x*- > holds. 

(3) // (i, j) is the leader of an SCI in B, then there is no SCI in B whose 
shifted column contains 



See Figure\% (h) - (d) for an illustration of the three conditions. 



Proof. Part (1): Assume there exists an SCI in B with shifted column S 
and bar B that contains the first nonzero entry of a row /c, i.e., there is 
{k,l) G S with xl^ > and xl^ = for all s < £. Let S' := S n T^-i,? 
be the entries of S above row k. Let C = {{k, 1), {k, 2), . . . ,{k,i — 1)} and 
B' = rowfc \(C + {k,£)). See Figure [H]p)]for an illustration. 

Because <S" is a shifting of col(A; — 1,£). 
and hence satisfied by x*. Since we have 



x{B') - x{S') < is an SCI 
|5'| < |5| (thus, weight(5') < 
weight(5)), it suffices to show that replacing the original SCI x{B)—x{S) < 
by x{B') — x{S') < gives another basis B' of x* (which also is reduced), 
contradicting the minimality of the weight of B. 

Due to x*(rowfc) = 1, x*{C) = 0, x*{B') - x*{S') < 0, and S' + {k, i) C S 
we have 



1 



Xi,p -\- X 



(B') 



^ X ; < xl, + x*{S') < x*{S) = x^{B) < 1. (7) 

Therefore, equality must hold throughout this chain. In particular, this 
shows x*{B') — x*{S') = 0. Thus, its suffices to show that the coefficient 
matrix of the equation system obtained from B' is non-singular, which can 
be seen as follows. 
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Since x*{S' + {k,£)) = 1 = x*{S) (see we know that all nonnegativity 
constraints Xrs > with (r, s) € 5 \ (5" + {k, i)) are contained in B and B'. 
The same holds for x^s > with {k, s) € C and for Xis > with {i, s) S 
rowj \B, where row i contains bar B (since x*{B) = 1 by ([7])). Thus, we can 
linearly combine the coefficient vector of x{B) — x{S) < from the coefficient 
vectors of the constraints x{B') — x{S') < 0, x(rowfc) = 1, x(rowj) = 1, and 
the nonnegativity constraints mentioned above. Since all these constraints 
are contained in B', this shows that the coefficient matrix of B' has the same 
row-span as that of 5, thus proving that it is non-singular as well. 

Part (2): Assume that there exists an SCI in B with leader (i, j), bar B, 
and shifted column S such that x*^ = 0. If 5 = {(1, ci), (2, C2), . . . , (r/, c,,)}, 
then we have (i,j) = (?7,i). Define B' := B — S' := S — (r/, c,,), and 

observe that B' / 0, S" / 0, i.e., \B\ > 1 and |5| > 1, because a reduced 
basis does not contain trivial SCIs by Claim [2l see Figure [H] |(2)[ Hence, 
x{B') — x{S') < is an SCI. We therefore have: 

= x*{B) - x*{S) = x\B') - x^S) < x*{B') - x*{S') < 0, (8) 

where the first equation holds because x{B) — x{S) < is satisfied with 
equality by x* and the second equation follows from x*j = 0. Hence, we 
know that x*{B') — x*{S') = 0. Since we have < \S\ (and consequently 
weight(5') < weight(iS)), again it remains to show that the coefficient vector 
of x{B) — x{S) < can be linearly combined from the coefficient vector of 
x{B') — x{S') < and some coefficient vectors of nonnegativity constraints 
in B and B'. But this is clear, as we have xt^ = and x'l \ = 0, where the 
latter follows from ([8]). 

Part (3): Assume that in B there exists an SCI 

x(Bi) -x(5i) < (9) 
with leader (z,j) = (77, j), bar Bi, and shifted column 
5*1 = {(l,ci), (2,C2), . . . , (??,c^)} 
(in particular: < j) and another SCI 

X(B2) -x(52) < (10) 
with bar B2 and shifted column 

5*2 = {(1, di), (2, d2),..., ivJ), iv + 1, dri+i), . . . , (r, d^)}. 
Hence, we have = (Vij) ^ 'S'2- Define 

53:={(l,di),(2,d2),...,(7?-l,Vl)} 

(i.e, the part of ^2 lying strictly above row i) and 

S4, ■■= {(l,ci),. . . , {r],Cn), (?? + l,dri+i), • • • , (r, dr)} 

(i.e. Si together with the part of ^2 strictly below row i). Clearly, 5*3 is a 
shifting of col(r/ — = col(i — and ^4 is a shifted column as well 

(due to < j < drj+i). Thus, with B3 = Bi — we obtain the SCIs 

x{B3) - xiSs) < (11) 

x{B2) - x{S4) < (12) 

(see Figure E\(M- 
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Since ([9]) and ([TO]) are contained in B, we have x*{Bi) — x*{Si) = and 
x*{B2) — x*{S2) = 0. Adding these two equations yields 

{x*{B3) - x^iSs)) + {x*iB2) - x*(54)) = 0, (13) 

because x*j cancels due to G i?i H S2. Since x* satisfies the SCIs ([TTI) 
and (fT2l) . Equation (fT3]) shows that in fact we have x*(i?3) — x*{S3) = and 
x*(S2) - x*iSi) = 0. 

It is not clear, however, that we can simply replace ([9|) and (fTO]) by (fTTI) 
and (fT2]) in order to obtain a new basis of x*. Nevertheless, if vi,V2,V3, 
and t;4 are the coefficient vectors of ([9]), (fTO]l . ([TT]) . and (fT2l) . respectively, 
we have t^i + 1^2 = ^^3 + "^4 , which implies 

V2 = V3 + V4, - vi. (14) 

Let V C IR,'^''''' be the subspace of IR,-^P'« that is spanned by the coefficient 
vectors of the constraints different from (fTO]) in B. Thus, the linear span 
of F U {^2} is the whole space R^p-". Due to (HH), the same holds for 
VU{v3,V4} (since vi G V). Therefore, there is a € {3,4} such that V[J{va} 
spans R-^P'''. Let (a) be the corresponding SCI from { ([TT]) . (fT2]) }. Hence, 
B' :=B\ {(Uni)} U {(a)} is a (reduced) basis of x* as well. 

Since we have {S-sl < \S2\ and weight(<S'4) < weight(52) (due to Crj < j), 
the weight of B' is smaller than that of B, contradicting the minimality of 
the weight of B. □ 

Before we finish the proof of the proposition by establishing Claim [H we 
need one more structural result on the SCIs in a reduced basis of x*. Let 
S = {(1, ci), (2, C2), . . . , (r/, c^)} be any shifted column with x*^^ ^ > for 
some 7 G [t]]. We call (7,0^) the first nonzero element of S if 

^a,ci> = • • • = a:*7-i,c^_i) = 
holds. Similarly, (7,0^) is called the last nonzero element of S if we have 

Claim 4. Let B be a reduced basis of x* , and let Si, S2 be the shifted columns 
of some SCIs in B (Si = S2 is allowed). 

(1) If is the first nonzero element of Si and G «S'2, then {i,j) is 
also the first nonzero element of S2. 

(2) If is the last nonzero element of Si with x*{Si) = 1 and G S2, 
then is also the last nonzero element of S2 and x*{S2) = 1. 

(3) If {i,j) is the last nonzero element of Si with x*{Si) = 1, then is 
not the first nonzero element of S2. 

Proof. Let 

5i = {(l,ci), (2,C2), . . . , (?7,Cr,)} and ^2 = {(1, di), (2, ^2), . . . , (r, d^)} 

be two shifted columns of SCIs with bars Bi and -B2, respectively, in the 
reduced basis B of x*. Suppose that (i, j) = (7, j) G Sir\S2, i.e., = j = 
holds. Define 

S'l := {(l,ci), (2,C2), . . . , (7 - l,c^-i)}, 
5^={(l,(ii), (2,^2),..., (7 -1,^7-1)}' 
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Figure 9: Illustration of sets used in the proof of Claim [D 

and ^2 := ^2 \ ^2, see Figure [H Since (7,i) € S*! n 52 holds, S[ U 5*2 is a 
shifted column and x{B2) — x{S[ U 6*2) < is an SCI. Thus, we obtain 

x%B2) - x*{S[) - x*{S'2) < 0. (15) 

Furthermore, since x{B2) — x{S2) < is contained in the basis B of x*, we 
have 

x*{B2)-x^{S'2)-x^{S'2)=^. (16) 
Subtracting (fTHj) from (fTSj) yields x*{S2) — x*{S[) < 0. We thus conclude 

x*{S'2) < x*{S[) and x*{S[) < x*(5^) (17) 

(where the second inequality follows by exchanging the roles of Si and ^2 in 
the argument). 

Part (1): If (i, j) is the first nonzero element of 5i, then we have x*{S'i) = 0. 
Thus, the first inequality of (fT7|) implies x*(52) = 0, showing that (z,j) is 
the first nonzero element of 82- 

Part (2): If (i,j) is the last nonzero element of Si and x*{Si) = 1 holds, 
then we have x*[S'i + = 1. With the second inequality of (fT7|l we 

obtain: 

1 = x*{S'i + < x'iS'2 + (i, j)) < ^^(^2) = x*iB2) < 1, 

where the last equation holds because x(i?2) — x{S2) < is contained in B. 
It follows that x*{S2) = 1 and is the last nonzero element of 82- 

Part (3): This follows from the first two parts of the claim, since B does not 
contain any trivial SCIs by Claim [2 □ 

We will now proceed with the proof of Claim [D Thus, assume that B* is a 
reduced basis of x* of minimal weight and suppose that B* contains at least 
one SCI. We are going to construct a point x ^ x* that satisfies the equation 
system obtained from contradicting the fact the x* is the unique solution 
to this system of equations. 

At the beginning, we set x = x* , and let A > be an arbitrary posi- 
tive number. Then we perform the following four steps (see Figure [TO] for 
illustrations of the first three). 

(1) For every (i,j) that is the first nonzero element of the shifted column of 
at least one SCI in B* , we reduce xij by A. 

(2) For every (i,j) that is the last nonzero element of the shifted column S 
of at least one SCI in B* with x*{S) = 1, we increase Xij by A. 
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(a) (b) (c) 

Figure 10: Illustration of the construction of x, Steps ([T]) to ((Sj. 



(3) For each i G [p\ and for all j = min{i, q}, min{i, q\ — 1, . . . ^1 (in this 
order): If is the leader of some SCI in B* , we adjust Xij such that, 
with B = {{ij), + 1),. . . , {i,mm{i,q})}, 



3:{B) 



1 iix''{B) = l 

x*{B) — X otherwise 

holds. 

(4) For each i € [p], adjust Xij in order to achieve a;(rowj) = 1, where 
j = min{^ : x*^ > 0}. 

The reason for treating the case x*{S) = 1 separately in Step[2]will become 
evident in the proof of Claim [8] below. 

The following four claims will yield that x is a solution of the equation 
system corresponding to B*. 

Claim 5. After Step for each shifted column S of some SCI in B* we 
have 

ifx\S) = l 
x*{S) — A otherwise. 



x{S) 



Proof. Let S be the shifted column of some SCI in B*. It follows from 
Part ([T]) of Claim m that the first nonzero element (i,j) of 5 is the only 
element in S whose x-component is changed (reduced by A) in SteplH Thus, 
after Step [H we have x{S) = x*{S) — A. 

If x*(5) < 1, then, by Part © of ClaimH x{S) is not changed in StepE 
Otherwise, x*{S) = 1, and Xki is increased by A in Step O where {k,i) is 
the last nonzero element of S. According to Part ^ of Claim [H no other 
component of x belonging to some element in S is changed in Step [21 Thus, 
in both cases the claim holds. □ 

Claim 6. No component of x belonging to the shifted column of some SCI 
in B* is changed in Step 0. 

Proof. Let S be the shifted column of some SCI in B*. According to Part dH]) 
of Claim [31 S does not contain the leader of any SCI in B* , since B* is a 
reduced basis of minimal weight. □ 

Claim 7. After Step{^ for each SCI in B* with shifted column S and bar B 
we have x{S) = x{B). 
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Proof. For an SCI in B* with shifted column S and bar B, we have x*{S) = 
x*{B). Thus, from Claims [5] and [6] it follows that x{S) = x{B) holds after 



Claim 8. Step [7] does not change any component of x that belongs to the 
shifted column or the bar of some SCI in B* . 

Proof. Let be such that x*^ = for all (. < j and x\- > 0. By Part ([I|) 
of Claim [31 is not contained in any shifted column of an SCI in B*. If 

is contained in the bar B of some SCI in B*, then clearly x*{B) = 1 
holds. Thus, after Step [U we have x(rowj) = x{B) = 1, which shows that 
Xij is not changed in Step [H □ 

We can now finish the proof of the proposition. Claims [7] and [8] show 
that X satisfies all SCIs contained in B* with equality. Furthermore, in all 
steps of the procedure only components Xij with x*j > are changed (this 
is clear for Steps [HO and[H for Step [3| it follows from Part ([2]) of Claim [3]). 
Since after Step [D, x satisfies all row-sum equations, this proves that x is a 
solution to the equation system obtained from B*. 

We assumed that B* contains at least one SCI. Let S be the shifted column 
of one of these. We know x*{S) > by Claim [2l Thus, let (i, j) be the first 
nonzero element of S". Hence, after Step[H we have -A. By Part © 

of Claim [U this still holds after Step [2 As Xij is also not changed in Steps [3] 
and [H (see Claims [6] and [8]) , we deduce x ^ x*, contradicting the fact that x* 
is the unique solution to the equation system belonging to B*. 

This concludes the proof of Proposition [121 D 

We hope that reading this proof was somewhat enjoyable. Anyway, at least 
it also gives us a linear description of the packing orbitopes for symmetric 
groups almost for free. 

Proposition 13. The packing orbitope 0'^g{6q) is completely described by 
the nonnegativity constraints, the row-sum inequalities, and the shifted col- 
umn inequalities: 

OlqC®?) = {xe n^"-" : X > 0, x(rowi) < 1 for i = 1, . . . ,p, 

x{B) - x{S) < for all SCIs with SC S and bar B }. 

Proof. Let Q C M!^^''' be the polyhedron on the right-hand side of the state- 
ment. We define A := {x e R^p+i.-j+i ; x(rowj) = 1 for all i £ [p-\-l]}. 

The proof of Proposition [121 in fact shows that its statement remains true 
if we drop all SCIs with shifted column S and S n coli / from the linear 
description. This follows from the fact that, due to x^i = 1 and Claim [21 no 
such SCI can be contained in any reduced basis of x* (using the notations 
from the proof of Proposition [T^ . Thus we obtain 



Step [31 



□ 



O: 



(18) 
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with 

Q = {xe ]R,^p+i.9+i : x{B) - x{S) < for all SCIs with bar B 
and shifted column 5 with S (1 coli = 0, 
Xij > for all € 2p+i,g+i \ coli, 
x(rowj —{i, 1)) < 1 for all i = 2, . . . ,p + 1}, 

where the last inequalities are equivalent (with respect to 0p_^i^q_^_i{6q+i)) to 
the nonnegativity constraints associated with the elements of coli by addition 
of row-sum equations. 

Define C := {x £ ]R,^p+i.9+i : xn = for alH G [p + 1]}, and denote by 
TT : ]R^p+i.9+i H the orthogonal projection. Since none of the inequalities 
defining Q has a nonzero coefficient in coli, we have vf~^((5 n £) = Q, hence 
Qn£ = 7r(Q). This yields 7f(^nQ) = 7f(^)n7f((5), which, due to 7r(^) = £, 
implies t:{^A n Q) = Q fl £. Thus, we obtain 

o|g(6,) = *(0=+i,,+i(6,+i)) = n g) = Q n £ = Q, 

where the first equation is due to Proposition [5l the second equation follows 
from (jlSp . and the final arises from identifying C, with IR-^P'?. □ 

4.5. Facets 

In this section, we investigate which of the constraints from the linear 
descriptions of O^g(Sq) and O^g(Sg) given in Propositions [T2] and [T3l re- 
spectively, define facets. This will also yield non-redundant descriptions. 

It seems to be more convenient to settle the packing case first and then 
to carry over the results to the partitioning case. Recall that we assume 
2 < p < (?. 

Proposition 14. 

(1) The packing orbitope 0'^g{6q) C IR,'^''''' is full dimensional: 

dim(0|,(6,)) = |J,,,| = - ^ = - 

(2) A nonnegativity constraint Xij > 0, € Tp^q, defines a facet of 
0^q{6q), unless i = j < q holds. The faces defined by xjj > with 
j < q are contained in the facet defined by Xqg > 0. 

(3) Every row-sum constraint 3;(rowj) < 1 for i G [p\ defines a facet of 

o|,(e,). 

(4) A shifted column inequality x{B) — x{S) < with bar B and shifted 
column S = {{l,ci), {2, C2), {rjjCff)} defines a facet of 0^q{&q) , un- 
less Tj > 2 and ci < C2 (exception I) or rj = 1 and B ^ {(l,ci + 1)} 
(exception II) hold. In case of exception I, the corresponding face is 
contained in the facet defined by the SCI with bar B and shifted column 
{(1, C2), (2, C2), . . . , [t], Crf)}. In case of exception II, the face is contained 
in the facet defined by the 5C/ x^i ^+1) — a^(i,ci) ^ 0. 

Proof Part (1): For all {kj) G 2p,g, we define V^'^ = (uf/) G R^P'" by 

ki Jl if = J < ^ and j < or (i,j) = (/c,£) /• •\^-t 

I (J otherwise 
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(a) matrix V*" 



(b) matrix V'' 



c 


2 
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Figure 11: (a) {(b) 



of Proposition ll^l " (c 



Illustration of the matrices used in the proof of parts ^ and ((Sj 
Example of an SCI that does not define a facet; see the proof of 



Part (m of Proposition [TH 



that is, V has 1-entries at position {k,£) and on the main diagonal up to 



column i, except that v^j^ = unless {k,£) = {q,q); see Figure [TT] (a). The 
columns of each V^^ are in non-increasing lexicographic order. Hence, by 
Part ([1]) of Observation [U each V^^ is a vertex of O^g(Sg). 

In order to show that these vectors are linearly independent, we fix an 
arbitrary ordering of the V^^ that starts with F^^, F^^, . . . , For 
each {k,£) G 1p,g, all points V^^ preceding V^^ have a 0-entry at position 
{k,i), while v^^ = 1. This shows that these \Ip,q\ vertices of 0^g{&q) are 
linearly independent. Together with this gives |Xp^q|-|-l afhnely independent 
points contained in O^g(Sq), proving that O^q(Sg) is full dimensional. The 
calculations in the statement are straightforward. 

Part (2): For G 2p,g \ : j < q} all points V''^ with {k,i) / {ij) 

are contained in the face defined by Xij > 0. Since this is also true for 0, the 
face defined by Xij > contains \1p,q\ afhnely independent points (see the 
proof of Part ([I])), i.e., it is a facet of 0^g{&q). 

For every vertex x* G 0^g{&q) contained in the face defined by Xjj > 
for some j < q, we have x*^g = for all l> j (because otherwise the columns 
of X* would not be in non-increasing lexicographic order). This shows that x* 
is contained in the facet defined by Xqq > 0. 

Part (3): In order to show that x(rowj) < 1 defines a facet of O^g(Sg) 
for i G [p], we construct points V^^ (depending on i) from the points V^^ 
defined in Part by adding a 1 at position (i, 1) if F'^^(rowj) = (see 
Figure [II][(b)]). The (|2p,g| - 1) points V^^ for all {k,l) G Zp^q - {i, 1), and 
the unit vector E^^ (with a single 1 in position satisfy x(rowj) = 1. 

Furthermore, they are affinely independent, since subtracting E^^ from all 
vectors V''^ yields vectors V'^^, which can be shown to be linearly independent 
similarly to Part ([1]); here, we need {k,i) ^ [i, 1). 

Part (4): Let x{B) — x{S) < be an SCI with bar B, leader (i,j) = {r],j), 
and shifted column S = {(1, ci), (2, C2), . . . , (?/, c,,)}. 

If r] > 2 and ci < C2 hold (exception I), then the SCI is the sum of the 
SCI 

X{l,c:,+1) - a;(i,ci> < 
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(a) All cases (b) Case A, (c) Case D, (d) Case S 

Figure 12: Illustration of the constructions in the proof of Part Q of Proposition 1141 



and the SCI with bar B and shifted column {(1, ci + 1), (2, C2), . . . , (r/, c^)}; 
see Figure [TT] (c) Repeating this argument (c2 — ci — 1) times proves the 
second statement of Part (j4]) for exception I. 

If = 1 and B = with j > ci + 1 hold (exception II), then the 

SCI is the sum of the SCIs x^i ^i+i) ~ ^(i,ci) • • • ) ^{i,i> ~ ^ 0- 

This proves the second statement of Part (|4} for exception II. 

Otherwise, let V be the set of vertices of O^g(Sg) that satisfy the SCI 
with equality, and let C = lin(V U {-B*-'}) be the linear span of V and the 
unit vector E'^K We will show that C = IR,-^P'«, which proves dim(aff(V)) = 
\1p,q\ — 1 (since € V). Hence, the SCI defines a facet of (Sg). 



To show that C = IR^P'? , we prove that i?^* € C for all (r, s) € Xj 



partition the set Zp^q \{B U S) into three parts (see Figure [12] (a) ): 

A ■.={{p, s) £ Ip^g ■■ {p<V and s < Cp) 01 p > r]}, 
-{{p, s) = (r, s) G Ip^g ■ p < V and r > i}, and 
--{{p, s) = (r, s) G Ip^g : p < rj, s > Cp, and r < i}. 



We 



C 
D 



For (r,s) = denote by diag-(r,s) = {(p, 1), (p, 2), . . . , (p, s)} the 

diagonal starting at (p, 1) = (r — s + 1,1) and ending at {p,s) = {r,s). 
Similarly, denote by diag-(r, s) = {{p, s) , {p, s + 1) , . . .} D Jp^q the diagonal 
starting at (r, s) and ending in colg or in rowp. 



Claim 9. For all (r, s) = {p,s) e AuC we have E''' G C. 



Proof. Denote the incidence vector of diag- 



by W 



^diag^(r,s) ^ 



see 



Figure [T2|[(b)]). Both and - E""' are vertices of Of g{&q). We have 
diag- (r, s) r\{B\J S) = for (r, s) G A. Furthermore 

|diag-(r, s) n S| = 1 = |diag-(r, s) n 5| 

for (r, s) G C . Hence, these two vertices satisfy the SCI with equality and 
we obtain = - {W' - E"-') G £. □ 

Claim 10. For all (r, s) = {p, s) e D we have G C. 

Proof. Define the set 

U{r, s) := diag^(r, s) U diag^(r + I, s) U {{{p + I, q), {p + 2, q), . . .} n Ip,q), 
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see Figure [r2l|(c)[ Let U^^ := x^^^'^^- By construction, the three points U^^ , 

jjrs _ ^rs^ g^^^ jjrs _ ^r+l,s vertices of 0|g(eg). 

If p = 1, we have \U{r,s) r\B\ = l and \U{r,s) nS'j = 1, where we need 
ci = C2 in case of s = ci + 1 (notice that in case of = 1 we have D = 0). 
Due to (r, s) ^ S U 5, both U''' and W - E''' satisfy the SCI with equahty. 
This yields E""' = - {V' - E''') G C. 

If p > 1, then \U{r, s) n S\ = 1 does not hold in all cases (e.g., if s = Cp+i, 
we have (r + 1, s) € S). However, since p > 1, C/(r — 1, s) is well-defined and 

\U{r-l,s)nB\ = l and \U{r - 1, s) n S\ = 1 

hold. Hence the vertices U"^"^'^ and U^~^''^ — E"^^ satisfy the SCI with equal- 
ity, giving E"-' = V-^^' - (C/^-i'^ - E"-') G £. □ 

Claim 11. For all (r, s) = (p, s) e S we have E^'^ G C. 

Proof. Define the set 

r(r,s) := diag^(r + j - s,i) U ({(p + 1, j), (p + 2, j), . . . } nZp,g), 

see Figure ll2j(d)[ The incidence vector T*"** := X"^^^'*^ is a vertex of 0^,j(©g), 
which, due to T(r, s) n 5" = {(r, s)} and T(r, s) n i? = {(i, j)} satisfies the 
SCI with equality. Thus, from 
^rs ^ j^rs _ ^ij _ ^ j^ki _ ^ j^ke _ ^ ^ke 

(k,e)eT(r,s)nA {k,£)&T{r,s)nC (k,e)£T{r,s)nD 

we conclude E^'^ G £, since E^^ G £ by definition of >C, and E''^ G £ for all 
(A;, G AUCUL» by Claims Hand [101 □ 

Claim 12. For all {i,s) = (p, s) e B we have E''' G £. 

Proof. The vector VF*'^ := x'^^^^^^'^'^^ is a vertex of O^g(Sg) that satisfies the 
SCI with equality. Furthermore, we have 

^is ^ y^^is _ ^rcp _ ^ ^k£ _ ^ 

(A:,^)ediag^ {i,s)nA {fc,^)Gdiag^ (i,s)nD 

where (r, Cp) := {p,Cp) G S. Thus, we conclude E'** G £, since E*^^ G £ for 
all (fc,£) G AU£»U5 by ClaimsEmni anddB □ 

Claims El to [I2] show E''' G C for all (r, s) G 2p,g. This proves that the SCI 
defines a facet of 0^y{&q) (unless exception I or II hold). □ 

Finally, we carry the results of Proposition [Ml over to partitioning or- 
bitopes. 

Proposition 15. 

(1) The partitioning orbitope 0^g(6g) C 1R!^p-'' has dimension 

dim(0=g(©g)) = lXp-i,g-i| = \Ip,q\ -P = {p - l){q- !)• 

The constraints a;(rowi) = 1 form a complete and non-redundant linear 
description of aS{0~q{&q)) . 

(2) A nonnegativity constraint Xij > 0, {i,j) G ^p,g, defines a facet of 
0~q{&q), unless i = j < q holds. The faces defined by xjj > with 
j < q are contained in the facet defined by Xqq > 0. 
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(3) A shifted column inequality x{B) — x{S) < with bar B and shifted co/- 
Mmn 5 = {(1, ci), (2, C2), ...,(??, c^)} defines a facet of 0~q{&q) , unless 
ci = 1 (Exception I) or rj > 2 and ci < C2 (Exception II) or r] = 1 
and B ^ {(l,ci + 1)} (Exception III). In case of Exception I, the cor- 
responding face is contained in the facet defined by xn > 0, where i is 
the index of the row containing B. In case of Exception II, the face 
is contained in the facet defined by the SCI with bar B and shifted col- 
umn {{1,02), (2,02), {i],Crj)}. In case of Exception III, the face is 
contained in the facet defined by the SCI X(^i^c. 1+1) ~ ^(i,ci) < 0. 

Proof. According to Proposition [5l 0^_^ ^_^(Sq_i) is isomorphic to 0p g{6q) 
via the orthogonal projection of the latter polytope to the space 

C:={x e E,^"'" : Xii=0 for all i e [p]} 

(and via the canonical identification of C and R-^p^^ ^^^). This shows the 
statement on the dimension of 0^g(©g); the calculations and the claim on 
the non-redundancy of the equation system are straightforward. 

Furthermore, this projection (which is one-to-one on aff (O^g(Sq))) maps 
every face of O^g(Sg) that is defined by some inequality 

{a,x) := ^ OijXij < ao, 

with a € IR,-^f'9, S R, and an = for all i G [p] to a face of 0^_^ q_^{&q-i) 
of the same dimension defined by 

Conversely, if (a, x) < do defines a face of 0^_-^ ^_-^{&q-i) for a E IR^p-i.-j-i 
and do S R, then the inequality 

djj Xj+i j+i < do 

(*j)G2:p,9 

defines a face of 0^g(©g) of the same dimension. 

Due to parts (0) and ^ of Proposition [HI this proves Part ^ of the 
proposition, where we use the fact that the inequalities xn > are equivalent 
to 2;( rowj — (i, 1)) < 1 with respect to O^g(Sq). 

Furthermore, due to Part ^ of Proposition [TU the above arguments also 
imply the statements of Part ([3]) for ci > 2 (including Exception II and III). 
Finally, we consider the case ci = 1 (Exception I). Since we have xi^i = 1 
for all X G O^g(Sg), the equation x{B) — x{S) = implies 

1 > x{B) = x{S) > = 1, 

and hence Xi^i = (using the row-sum equation for row i containing B). 
This concludes the proof. □ 

4.6. Summary of Results on the Symmetric Group 

We collect the results on the packing- and partitioning orbitopes for sym- 
metric groups. 
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Theorem 16. The partitioning orbitope O^q(Sg) {for p > q > 2) with 
respect to the symmetric group &q equals the set of all x G R-^P'' that satisfy 
the following linear constraints: 

o the row-sum equations x{vowi) = 1 for all i G [p], 

o the nonnegativity constraints Xij > for all {i,j) € Ip^g\{{j, j) : j < q}, 
o the shifted column inequalities x{B) — x{S) < for all bars 

B = {(i, j), + 1),. . . , (i,min{i,g})} 

with = {Ti,j) G ^p,q, j ^ 2, and shifted columns 

S = {(l,Ci), (2,C2), . . . , {r],Cri)} 2 < Ci = C2 < • • • < < i - 1, 

where in case of i] = 1 the last condition reduces to 2 < ci and we 
additionally require j = ci + 1. 

This system of constraints is non-redundant. The corresponding separation 
problem can be solved in time 0{pq). 

For the result on the completeness of the description, see Proposition [121 
for the question of redundancy see Proposition [15], and for the separation 
algorithm see Corollary [TTl Note that the SCI with shifted column {(1,1)} 
and bar {(2,2)} defines the same facet of O^q(Sq) as the nonnegativity 
constraint X2,i > 0. 

Theorem 17. The packing orbitope 0'^g{&q) {for p > q > 2) with respect 
to the symmetric group &q equals the set of all x € IR?'^-'' that satisfy the 
following linear constraints: 

o the row-sum inequalities x(rowj) < 1 for all i G [p], 

o the nonnegativity constraints Xij > for all {i,j) G 2p^q\{{j,j) : j < q}, 
o the shifted column inequalities x{B) — x{S) < for all bars 

B = {{i,j),{hj + I),-- - ,(«,niin{i,g})} 

with {i,j) = {Ti,j) G ^p,qj j ^ 2, and shifted columns 

S = {(l,ci), (2,C2), . . . , {r],Cn)} with ci = C2 < • • • < < j - 1, 

where in case of rj = 1 we additionally require j = ci + 1. 

This system of constraints is non-redundant. The corresponding separation 
problem can be solved in time 0(jpq). 

For the result on the completeness of the description, see Proposition [131 
for the question of redundancy see Proposition [T31 and for the separation 
algorithm see Corollary [TTl 

5. Concluding Remarks 

We close with some remarks on the technique used in the proof of Propo- 
sition [121 oil the combination of SCIs and clique-inequalities for the graph- 
coloring problem, and on full and covering orbitopes. 
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Figure 13: Combination of a clique inequality and an SCI. 



The Proof Technique. Our technique to prove Proposition [12] can be sum- 
marized as follows. Assume a polytope Q C R" is described by some (finite) 
system Q of linear equations and inequalities. Suppose that Q' is a subsys- 
tem of Q for which it is known that Q' defines an integral polytope Q' ^ Q. 
One can prove that Q is integral by showing that every vertex x* of Q is a 
vertex of Q' in the following way. Here we call a basis (with respect to Q) 
of X* reduced if it contains as many constraints from Q' as possible: 

(1) Starting from an arbitrary reduced basis ;B of x*, construct iteratively a 
reduced basis B* of x* that satisfies some properties that are useful for 
the second step. 

(2) Under the assumption that B* ^ Q', modify x* to some x ^ x* that 
also satisfies the equation system corresponding to B* (contradicting the 
fact that B* is a basis). 

(In our proof of Proposition [121 Step ^ was done by showing that a reduced 
basis of "minimal weight" has the desired properties.) 

Such a proof is conceivable for every 0/1-polytope Q by choosing Q' = 
[0, 1]" as the whole 0/1-cube and Q' as the set of the 2n trivial inequalities 
< < 1, for z = 1, . . . , n (if necessary, modifying Q in order to contain 
them all). 

We do not know whether this kind of integrality proof has been used in 
the literature. It may well be that one can interpret some of the classical 
integrality proofs in this setting. Anyway, it seems to us that the technique 
might be useful for other polytopes as well. 

The Graph- Coloring Problem. As mentioned in the introduction, for con- 
crete applications like the graph coloring problem one can (and probably 
has to) combine the polyhedral knowledge on orbitopes with the knowledge 
on problem specific polyhedra. We illustrate this by the example of clique 
inequalities for the graph coloring model ([T]) described in the introduction. 

Fix a color index j £ [C]. If W V is a clique in the graph G = {V, E), 
then clearly the inequality ^i^y/ Xij < 1 is valid. In fact, the strengthened 
inequalities X^ieVF — known to be facet-defining for the convex 

hull of the solutions to ([I]), see [Ij. Suppose that S C ^\v\,c is a shifted 
column and that we have rj < \S\ for all {r],j) = with i G W. Then the 
inequality 

Xij — x{s) < 



PACKING AND PARTITIONING ORBITOPES 



31 



is valid for all solutions to the model obtained from ([T]) by adding inequali- 
ties ([2]) (which are all "column inequalities" in terms of orbitopes), see Fig- 
ure [131 The details and a computational study will be the subject of a 
follow-up paper. 

Full and Covering Orbitopes. As soon as one starts to consider 0/1-matrices 
that may have more than one 1-entry per row, things seem to become more 
complicated. 

With respect to cyclic group actions, we loose the simplicity of the char- 
acterizations in Observation [H The reason is that the matrices under inves- 
tigation may have several equal nonzero columns. In particular, the lexico- 
graphically maximal column may not be unique. 

With respect to the action of the symmetric group, we still have the char- 
acterization of the representatives as the matrices whose columns are in 
non-increasing lexicographic order (see Part [T] of Observation [T|) . The struc- 
tures of the respective full and covering orbitopes, however, become much 
more complicated. In particular, we know from computer experiments that 
several powers of two arise as coefficients in the facet-defining inequalities. 
This increase in complexity is reflected by the fact that optimization of linear 
functionals over these orbitopes seems to be more difficult than over packing 
and partitioning orbitopes (see the remarks at the end of Section [2]T]) . 

Let us close with a comment on our choice of the set of representatives 
as the maximal elements with respect to a lexicographic ordering (referring 
to the row- wise ordering of the components of the matrices). It might be 
that the difficulties for full and covering orbitopes mentioned in the previous 
paragraph can be overcome by the choice of a different system of representa- 
tives. The choice of representatives considered in this paper, however, seems 
to be appropriate for the packing and partitioning cases. 

Whether the results presented in this paper are useful in practice will turn 
out in the future. In any case, we hope that the reader shares our view that 
orbitopes are neat mathematical objects. It seems that symmetry strikes 
back by its own beauty, even when mathematicians start to fight it. 
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